AITopics | central tendency

Collaborating Authors

central tendency

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Deriving Lehmer and H\"older means as maximum weighted likelihood estimates for the multivariate exponential family

Ziou, Djemel, Fakir, Issam

arXiv.org Artificial IntelligenceMay-1-2024

Consider numerical observations; it is common to calculate their mean and refer to it as central tendency. There are, however, different measures of mean [4]. These measurements are sometimes grouped into families, like Lehmer and Hölder. Distinguishing these measures and better understanding their use involves identifying the link between them and probability density functions (PDFs). For example, the arithmetic mean is the maximum likelihood estimator (MLE) of the position parameter for the normal PDF and the scale parameter for the exponential PDF. For the families of Lehmer and Hölder means, such an interpretation has only recently been proposed for the case of PDFs in the case of the univariate exponential family Let's consider digital observations; it is often common to calculate their mean and designate it as a central tendency. However, there are various measures of the average [2]. These measures are sometimes grouped into families, such as Lehmer and Hölder.

central tendency, exponential family, multivariate exponential family, (12 more...)

arXiv.org Artificial Intelligence

2405.00964

Country:

North America > United States (0.47)
North America > Canada > Quebec > Estrie Region > Sherbrooke (0.04)
North America > Canada > British Columbia (0.04)

Genre: Research Report (0.50)

Industry:

Government (0.48)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.35)

Add feedback

The Basic Essentials: Statistics For Machine Learning

#artificialintelligenceApr-3-2022, 02:25:42 GMT

Knowledge of statistics is important if you work with data. Having a firm grasp of some fundamental concepts goes a long way in your ability to effectively communicate. You'll also understand the proper methods to collect, analyze, make decisions, and effectively present results that have been discovered from data. In this article, we are going to be using the Breast Cancer Wisconsin dataset from sklearn to cover some fundamental statistics concepts. Below we've imported the necessary frameworks and loaded our data into memory.

dataset, percentile, statistics, (14 more...)

#artificialintelligence

Country: North America > United States > Wisconsin (0.25)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.66)

Add feedback

Driving Style Recognition Using Interval Type-2 Fuzzy Inference System and Multiple Experts Decision Making

Gomes, Iago Pachêco, Wolf, Denis Fernando

arXiv.org Artificial IntelligenceOct-26-2021

Driving styles summarize different driving behaviors that reflect in the movements of the vehicles. These behaviors may indicate a tendency to perform riskier maneuvers, consume more fuel or energy, break traffic rules, or drive carefully. Therefore, this paper presents a driving style recognition using Interval Type-2 Fuzzy Inference System with Multiple Experts Decision-Making for classifying drivers into calm, moderate and aggressive. This system receives as input features longitudinal and lateral kinematic parameters of the vehicle motion. The type-2 fuzzy sets are more robust than type-1 fuzzy sets when handling noisy data, because their membership function are also fuzzy sets. In addition, a multiple experts approach can reduce the bias and imprecision while building the fuzzy rulebase, which stores the knowledge of the fuzzy system. The proposed approach was evaluated using descriptive statistics analysis, and compared with clustering algorithms and a type-1 fuzzy inference system. The results show the tendency to associate lower kinematic profiles for the driving styles classified with the type-2 fuzzy inference system when compared to other algorithms, which is in line with the more conservative approach adopted in the aggregation of the experts' opinions.

fuzzy inference system, membership function, style recognition, (12 more...)

arXiv.org Artificial Intelligence

2110.13805

Country:

South America > Brazil > São Paulo (0.04)
South America > Argentina > Patagonia > Río Negro Province > Viedma (0.04)
North America > United States > Massachusetts (0.04)

Genre: Research Report > New Finding (0.48)

Industry:

Transportation > Ground > Road (1.00)
Automobiles & Trucks (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (1.00)

Add feedback

Central Tendency: Demystifying "Average" Term

#artificialintelligenceNov-4-2020, 09:00:34 GMT

Skewness indicates whether the data is concentrated on one side. If the histogram is shifted to the left or the right side then the distribution is skewed otherwise, it's not skewed. There are two types of skewness namely right (positive) skew and left (negative) skew. In the right skew case, the 3 measurements will be: mode median mean. Right skew tells us that the outliers are located to the right.

artificial intelligence, central tendency, outlier, (4 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

Arithmetic Mean and Its Applications in Data Analytics

#artificialintelligenceJul-21-2020, 01:06:04 GMT

It's a household name and we use the term every day. And it is arguably the most popular of all statistical terms. Yet, its properties, variants, and versatile use cases are not obvious to many of us. We tend to think that it's just an average of some numbers -- and that's all we need to know. I'm talking about Arithmetic Mean. The purpose of this article is to shed light on little-explored areas of the arithmetic mean including its properties, use cases, and limitations.

arithmetic mean, artificial intelligence, use case, (7 more...)

#artificialintelligence

Technology:

Information Technology > Data Science (0.40)
Information Technology > Artificial Intelligence (0.40)

Add feedback

A Hierarchy of Limitations in Machine Learning

Malik, Momin M.

arXiv.org Machine LearningFeb-29-2020

There is little argument about whether or not machine learning models are useful for applying to social systems. But if we take seriously George Box's dictum, or indeed the even older one that "the map is not the territory' (Korzybski, 1933), then there has been comparatively less systematic attention paid within the field to how machine learning models are wrong (Selbst et al., 2019) and seeing possible harms in that light. By "wrong" I do not mean in terms of making misclassifications, or even fitting over the'wrong' class of functions, but more fundamental mathematical/statistical assumptions, philosophical (in the sense used by Abbott, 1988) commitments about how we represent the world, and sociological processes of how models interact with target phenomena. This paper takes a particular model of machine learning research or application: one that its creators and deployers think provides a reliable way of interacting with the social world (whether that is through understanding, or in making predictions) without any intent to cause harm (McQuillan, 2018) and, in fact, a desire to not cause harm and instead improve the world, 1 for example as most explicitly in the various "{Data [Science], Machine Learning, Artificial Intelligence} for [Social] Good" initiatives, and more widely in framings around "fairness" or "ethics." I focus on the almost entirely statistical modern version of machine learning, rather than eclipsed older visions (see section 3). While many of the limitations I discuss apply to the use of machine learning in any domain, I focus on applications to the social world in order to explore the domain where limitations are strongest and stickiest.

correlation, machine learning, prediction, (15 more...)

arXiv.org Machine Learning

2002.05193

Country:

Oceania > Australia (0.14)
North America > United States > California (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(19 more...)

Genre:

Research Report > Experimental Study (1.00)
Overview (1.00)
Research Report > New Finding (0.92)

Industry:

Law > Civil Rights & Constitutional Law (1.00)
Information Technology (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
(14 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Arithmetic, Geometric, and Harmonic Means for Machine Learning

#artificialintelligenceDec-17-2019, 17:43:09 GMT

Calculating the average of a variable or a list of numbers is a common operation in machine learning. It is an operation you may use every day either directly, such as when summarizing data, or indirectly, such as a smaller step in a larger procedure when fitting a model. The average is a synonym for the mean, a number that represents the most likely value from a probability distribution. As such, there are multiple different ways to calculate the mean based on the type of data that you're working with. This can trip you up if you use the wrong mean for your data.

arithmetic mean, geometric mean, harmonic mean, (9 more...)

#artificialintelligence

Country: Asia > Philippines > Luzon > National Capital Region > City of Manila (0.05)

Genre: Instructional Material (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.99)

Add feedback

A Gentle Introduction to Calculating Normal Summary Statistics

#artificialintelligenceMay-4-2018, 19:26:42 GMT

A sample of data is a snapshot from a broader population of all possible observations that could be taken of a domain or generated by a process. Interestingly, many observations fit a common pattern or distribution called the normal distribution, or more formally, the Gaussian distribution. A lot is known about the Gaussian distribution, and as such, there are whole sub-fields of statistics and statistical methods that can be used with Gaussian data. In this tutorial, you will discover the Gaussian distribution, how to identify it, and how to calculate key summary statistics of data drawn from this distribution. A Gentle Introduction to Calculating Normal Summary Statistics Photo by John, some rights reserved.

artificial intelligence, gaussian distribution, machine learning, (16 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.91)

Add feedback

Domino habits for data science

@machinelearnbotApr-9-2018, 00:15:13 GMT

Inculcating discipline [Understanding business justification] – Explore and document'why' your data is there? What are the technical systems / business processes that generated this data? Have you talked to people who decided to log the data? Staying grounded and staying updated - Did you revisit the concepts and did a read-up of the best practices (again)? Have you checked the math?

data science, domino habit, learning, (9 more...)

@machinelearnbot

Technology:

Information Technology > Data Science (0.71)
Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Identifying Consistent Statements about Numerical Data with Dispersion-Corrected Subgroup Discovery

Boley, Mario, Goldsmith, Bryan R., Ghiringhelli, Luca M., Vreeken, Jilles

arXiv.org Artificial IntelligenceApr-23-2017

Existing algorithms for subgroup discovery with numerical targets do not optimize the error or target variable dispersion of the groups they find. This often leads to unreliable or inconsistent statements about the data, rendering practical applications, especially in scientific domains, futile. Therefore, we here extend the optimistic estimator framework for optimal subgroup discovery to a new class of objective functions: we show how tight estimators can be computed efficiently for all functions that are determined by subgroup size (non-decreasing dependence), the subgroup median value, and a dispersion measure around the median (non-increasing dependence). In the important special case when dispersion is measured using the average absolute deviation from the median, this novel approach yields a linear time algorithm. Empirical evaluation on a wide range of datasets shows that, when used within branch-and-bound search, this approach is highly efficient and indeed discovers subgroups with much smaller errors.

artificial intelligence, objective function, optimistic estimator, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s10618-017-0520-3

1701.07696

Country: Europe > Germany (0.46)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)

Add feedback